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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

1. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

2. Claims 1-5,7-9,12-16,20,21,23-25 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Ashby, III et al. (5,852,803) in view of Walters et al. (6,453,281) in further 
view of Dragosh et al (6366886) in further view of Ronglev (5758322). 

The US patent of Ashby et al. discloses a computer-based system and thus, 
necessarily includes the apparatus, the method of operation and the article of 
manufacture of this system. Consequently, rejections for claims 1-1 1, 12-20, 21-24 and 
25 have been combined, as they all address the same subject matter. Where 
appropriate, all interrelated limitations from these claims have been listed in the same 
location. 

Referring to claims 1, 12, 21, 25, Ashby, 111 et al. disclose: 
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inputting at least a portion of the electronically-readable identifier marked on an 
item (bar code and bar-code reader (1st device) col.2, In. 1-5 col.5, 

inputting from a user a spoken utterance that corresponds to the item (voice 
information and microphone 2nd device) col.l , ln.63 - col.2, In.l and col.5, In .33-35)., 

associating the electronically-readable identifier (bar-code label) input from 
the item with the spoken utterance input from the user on storage device (81, FIG. 1 1) by 
use of processor (78, FIG. 1 1) (See also col.3, ln.36-39., col.5, In. 18-34 and col.9, 
ln.60-65); 

and outputting the spoken utterance when the electronically-readable 
identifier associated with the spoken utterance is subsequently inputted 
(speaker, 36, FIG. 1 1 and col.3, ln.36-52 and col.9, In.31-47). 

Ashby, III et al, do not disclose 

"converting the spoken utterance input by the user to text" and 
"associating electronically-readable identifier and the corresponding text. 
Note: While Ashby et al. suggest the use of voice recordings instead of text data 
(Col. 9, lines 48-57), Ashby et al.'s do not explicitly teach away from the converting 
user-spoken utterances to text. Ashby et al. only suggest that using voice recordings 
instead of synthesized data would improve user's experience. However, from the 
functional perspective, Ashby et al.'s system would achieve the same goals of delivering audible 
information to the user if it were to synthesize this information from the text. In addition, as it 
would have been obvious to one of the ordinary skill in the art, Ashby et al. f s system suffers 
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from the inefficiencies associated with a storage and maintenance of a large number of voice 
recordings. 

Walters et al. teach converting user's voice input to text (Col. 9, lines 41-50) for 
the purpose of recording user's voice records. As it is well-known in the art, text version 
of a document requires far less storage space than the voice-recorded version of the 
same document. In addition, it is notoriously known in the art of software engineering 
that management of text data (database storing, lookups, searches, etc.) is much 
simpler than the similar management of binary data, such as voice recordings (Col. 1, 
lines 42-55). Walter et al.'s system thus solves the problem of manipulation and 
maintenance of large number of voice records - the problems also associated with 
Ashby et al.'s system, since Ashby's system operates with a large number of voice 
recordings. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify the method of Ashby, III et al. as taught by 
Walters et al. to convert voice utterances to text because it would advantageously allow the user 
to be in command of a large amount of information with greater efficiency (Walters et al, col. 1 , 
Lines 42-55). In addition, examiner takes the official notice that it is extremely well-known in 
the art of software engineering to perform searches of the databases using text data, 
since it is the fastest and most efficient way which is currently available in the art of 
organizing and searching the databases. 
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The combination of Ashby III et al in view of Walters does not explicitly teach using a 
remote speech-to-text (speech recognition) system. 

However, neither Ashby et al. nor Walter et al. explicitly teach against connecting 
to a network. For example, Walter et al. teaches connecting to a remote database (Col. 
28, lines 40-45), while Ashby et al also teach using a terminal workstation (74, FIG. 1 1) 
which would suggest a network connection in the modern days (at least after 1996, 
while Ashby et al's patent has been filed in 1992, when network connectivity was not as 
prevalent) 

Dragosh et al (6366886), however, teaches a method, wherein the speech-to-text 
conversion is performed on a remote computing device (Fig, 7, the ASR client streams speech 
features over a packet network to a remote ASR server, wherein the ASR server performs speech 
recognition and returns text data back to the ASR client). 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify the combination of Ashby, III et al. in view of 
Walters et al. as taught by Dragosh et al (6366886) to have speech-to-text conversion performed 
on a computing device remotely located with respect to a computing system performing the other 
steps,in order to provide an efficient centralized way to transmit speech data and to facilitate 
speech processing of data in the speech recognition system of Walters et al. because, 
it would allow for local control over speech parameters, yet sending the bulk speech recognition 
process to a remote location (Dragosh et al (6366886), col. 2 lines 15-30) . 

The combination of Ashbv, III et al. (5,852,803) in view of Walters et al (6,453,281) in 
further view of Dragosh et al (6366886) does not explicitly teach having a remote accessabily 
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removable storage medium that contains identifeier and text converted utterances representative 
of users which can be accessed by users in a future time and place, however, Ronglev (5758322) 
teaches the portability of stored voice file information based upon the user's preference (col. 1 1 
lines 30-52). Therefore, it would have been obvious to one of ordinary skill in the art of remote 
speech signal processing systems to modify the combination of Ashby, III et al. (5,852,803) in 
view of Walters et al. (6,453,281) in further view of Dragosh et al (6366886) with remote 
accesibile voice file user information as taught by Ronglev (5758322) because it would 
advantageously reduce the possible errors of misrecognition (col. 1 1 lines 42-46). 

Referring to claims 2, 13, Ashby, 1 1 1 et al. further disclose the method, 
wherein the electronically-readable identifier marked on the item is a universal 
product code (col.9, ln.31-34). 

Referring to claims 3,14, Ashby, 1 1 1 et al. further disclose the method of 
reading the universal product code from each item with a barcode reader (col.9, 
ln.32-36, col. 10, In. 21-31 and Fig. 1 , element #80). 

Referring to claims 4, 15, Ashby, 1 1 1 et al. further disclose the method, 
wherein the associating step comprises storing the spoken utterance corresponding 
to each item using the universal product code read by the barcode reader for each 
item as an index (memory, col.9, ln.31-47 and col.ll, In. 1-3). 
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Referring to claims 5,16, Ashby, 1 1 1 et al. further disclose the method, wherein 
the outputting step comprises: subsequently reading the universal product code from 
an item with the barcode reader (col.9, ln.31-34); searching stored spoken utterances 
using the universal product code as an index (col. 10, ln.66 - col. 1 1, ln.3) ; and 
playing back the spoken utterance that is found in the search to the user (col. 11, 
In .3-7). 

Referring to claim 7, Ashby, III et al. do not disclose the step of converting 
spoken utterance to text by a speech recognition system. 

However, Walters et al. teach using speech recognition system to convert 
spoken utterance to text. (154, Fig. 4 and Col. 9, lines 25-40) 
Therefore, it would have been obvious to one having ordinary skill in the art at the 
time the invention was made to modify the method of Ashby, III et al. as taught by 
Walters et al. to convert the spoken utterance to text by using a speech recognition 
system in order simplify the process of storing, maintaining and retrieving records 
(either synthesized or previously recorded) associated with the electronically-readable 
identifiers (Col. 1, lines 42-55) 

Referring to claims 8-9, 23, Ashby, III et al. do not disclose converting text 
back to speech when the system determines that electronically-readable identifier input 
from the item is associated with the corresponding text. 

However, Ashby et al. do teach outputting voice recordings when an associated 
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electronic code is entered in the system (col.3, ln.36-52 and col.9, ln.31-47). As it has 
been explained in the rejection for claim 1, it would have been obvious to one of the 
ordinary skill in the art to modify Ashby et al. to convert voice utterances to text and 
associate this text with the corresponding electronic labels for the easier storage and 
improved operational efficiency. 

In addition, Walters et al. teach converting text back to speech using text-to- 
speech system for the playback of stored records (156, FIG. 4 and Col. 10, lines 18-27) 
As a result, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify the method of Ashby, 1 1 1 et al. as taught by 
Walters et al. to play back stored records associated with the electronic identifiers 
using text-to-speech conversion, in order simplify the process of retrieving data records 
originally stored as text and associated with the electronically-readable identifiers (Col. 
1, lines 42-55), particularly because, as it is well-known, text recordings do not require 
as much storage capacity as binary voice recordings. In addition, outputting audio data 
allows users to browse/listen to records by audio means only, hence, improving the 
overall user experience for people incapable or disinclined to read product labels 
(Ashby et al., Col. 1, lines 42-51) 

Referring to claim 24, Ashby, 1 1 1 et al. further disclose apparatus configured to 
be user-portable (fig. 1 1, top element #74). 
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Response to Arguments 

3. Applicant's arguments filed 6/8/2006 have been considered but are moot in view of the 
new grounds rejection. The Rongley reference has been introduced to address the newly 
amended claim limitations. 

Conclusion 

4. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

5. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael Opsasnick, telephone number (571)272-7623, 
who is available Tuesday-Thursday, 9am-4pm. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Mr. Richemond Dorvil, can be reached at (571)272-7602. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 
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Michael Jsf. Opsasnick 
Examiner 
Art Unit 2655 



